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ABSTRACT 

The inducible transcription factor Egr-1, which rec- 
ognizes a 9-bp target DNA sequence via three zinc- 
finger domains, rapidly activates particular genes 
upon cellular stimuli such as neuronal signals and 
vascular stresses. Here, using the stopped-f low fluo- 
rescence method, we measured the target search ki- 
netics of the Egr-1 zinc-finger protein at various ionic 
strengths between 40 and 400 mM KCI and found the 
most efficient search at 150 mM KCI. We further in- 
vestigated the kinetics of intersegment transfer, dis- 
sociation, and sliding of this protein on DNA at dis- 
tinct concentrations of KCI. Our data suggest that 
Egr-1 's kinetic properties are well suited for efficient 
scanning of chromosomal DNA in vivo. Based on a 
newly developed theory, we analyzed the origin of 
the optimal search efficiency at physiological ionic 
strength. Target association is accelerated by non- 
specific binding to nearby sites and subsequent slid- 
ing to the target as well as by intersegment transfer. 
Although these effects are stronger at lower ionic 
strengths, such conditions also favor trapping of the 
protein at distant nonspecific sites, decelerating the 
target association. Our data demonstrate that Egr- 
1 achieves the optimal search at physiological ionic 
strength through a compromise between the posi- 
tive and negative impacts of nonspecific interactions 
with DNA. 

INTRODUCTION 

To perform their function, most transcription factors and 
DNA repair/modifying enzymes must stochastically scan 
DNA and locate their specific target sites in the vast pres- 



ence of nonspecific but structurally similar sites. Nonspe- 
cific interactions with DNA have both positive and nega- 
tive impacts on the kinetics of the target search by these 
proteins. Nonspecific sites near the target serve as an an- 
tenna that accelerates target location by the proteins via 
nonspecific association followed by sliding on DNA (1-10). 
At the same time, nonspecific sites distant from the target 
can trap the proteins and decelerate the target search pro- 
cess (6,1 1). This trapping effect can be particularly strong in 
the nucleus, where the DNA density is extremely high (~ 100 
mg/ml) (12). Although ~80% of chromosomal DNA is oc- 
cupied by nucleosomes, the total concentration of accessible 
DNA segments between nucleosomes (average linker length 
~56 bp) (13) is estimated to be ~0.5 mM. Because this 
concentration is far higher than the typical apparent affini- 
ties of sequence-specific DNA-binding proteins for nonspe- 
cific DNA, it is qualitatively obvious that nonspecific DNA 
effectively traps the proteins and decelerates their target 
search process. However, the attenuation of search efficiency 
due to the trapping effect and its relationship to the mecha- 
nisms of facilitated target location by these proteins are not 
well understood quantitatively. 

Here we address this issue for the inducible transcription 
factor Egr-1 (also known as Zif268), which recognizes the 
9-base-pair (bp) target sequences GCG(T/G)GGGCG via 
three zinc-finger domains (14,15). In the brain, Egr-1 is in- 
duced by synaptic signals and activates genes for long-term 
memory formation and consolidation (16,17). In the car- 
diovascular system, Egr-1 serves as a stress-inducible tran- 
scription factor that activates the genes to initiate defense 
against vascular stress and injury (18,19). Within its short 
lifetime (~ 0.5-1 h) (18), Egr-1 rapidly regulates the target 
genes, allowing the cells to quickly respond to stimuli. It 
is therefore important to understand how Egr-1 efficiently 
scans DNA. Our previous biophysical studies have revealed 
important features of DNA scanning by Egr-1 (20-23). The 
nuclear magnetic resonance (NMR) and stopped-flow fluo- 
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rescence studies demonstrated extremely efficient interseg- 
ment transfer of Egr-1 between nonspecific DNA duplexes 
(20,23). The NMR study showed that Egr-1 scans DNA 
via two conformational^ different states termed the search 
and recognition modes (23). In the search mode, one zinc 
finger is locally dissociated from DNA while the other two 
zinc fingers remain bound to DNA. Our computational and 
experimental studies have suggested that the search mode 
facilitates intersegment transfer via intermediates in which 
a protein molecule transiently bridges two DNA duplexes 
(21,23). 

In the current study, we measure the target search ki- 
netics as a function of ionic strength and demonstrate that 
Egr-1 exhibits its optimal search efficiency at physiological 
ionic strength. Varying the ionic strength influences the an- 
tenna and trapping effects in completely different manners 
and therefore allows us to quantitatively assess the inter- 
play between these effects in the target DNA search process. 
We also introduce a new theoretical method that explic- 
itly investigates the effect of non-specific interactions and 
intersegment transfer. Analytical expressions for the target 
search kinetics in terms of the enhancement factors of the 
antenna effect and intersegment transfer and the attenua- 
tion factor of the trapping effect are obtained. Our experi- 
mental data and the newly developed theory shed light on 
Egr-1 's properties that are well suited for efficient scanning 
of DNA in the nucleus. 

MATERIALS AND METHODS 

Protein and DNA 

In this study, we used a protein construct of the Egr-1 
DNA-binding domain comprising three zinc fingers 
(human Egr-1 residues 335^432). For the sake of sim- 
plicity, this construct is hereafter referred to as the Egr-1 
zinc-finger protein. The protein was expressed in Es- 
cherichia coli and purified as described in our previous 
work (20,22,23). For fluorescence experiments, 33-, 48-, 
63-, 88-, 113- and 143-bp probe DNA duplexes with a 
fluorescein amidite (FAM) attached to the 5 / -terminus 
were prepared using synthetic DNA strands and enzymatic 
reactions as described (20). The duplexes contain a single 
Egr-1 target site and share the terminal 33 bp: FAM- 
AGCGTGGGCGTACCGGTAACTATCGTCTTGAGT- 
y (the Egr-1 target is underlined). Their complete se- 
quences are given in (20). A 28-bp competitor DNA duplex 
(y-GTACCGATTGCAGATTCCGAACCTTCAG-3') 
containing neither specific nor semi-specific sites was 
prepared as previously described (20,23). 

Stopped-flow fluorescence assay of the target search kinetics 

The target search kinetics was measured at 20 °C using an 
ISS PC-1 spectroffuorometer equipped with an Applied 
Photophysics RX.2000 stopped-flow device as described 
(20). In this assay, the time courses of the FAM fluores- 
cence were monitored upon mixing two solutions: (i) the 
Egr-1 zinc-finger protein, and (ii) FAM-labeled probe DNA 
and 28-bp nonspecific competitor DNA. Hereafter, the to- 
tal concentrations of the probe DNA, protein and competi- 
tor DNA in the reaction mixture are represented by the sym- 



bols Aot, Ptou and C to t, respectively. All experimental and 
theoretical analyses in this study used the conditions Aot 
^ Ptot Qot- Because of these inequalities, the relevant 
second-order processes occur in a pseudo-first-order man- 
ner, which simplifies the kinetic analysis (24). The buffers 
used were 10 mM TrisHCl (pH 7.5), 200 nM ZnCl 2 and 40- 
400 mM KC1. Apparent pseudo-first-order rate constants 
^app were measured by mono-exponential fitting to the flu- 
orescence time-course data (20). The apparent second-order 
rate constants k a for the target association were determined 
from /c app data using various protein concentrations P tot , 
typically between 20 and 400 nM (see the Supplementary 
Materials). This range is biologically relevant, because west- 
ern blot and DNA association data for nuclear extracts (e.g. 
18,19) implies that the maximum level of nuclear Egr-1 in 
vivo is of order of 10 -9 -10 -7 M. To analyze sliding of Egr- 
1 on DNA, we measured the rate constants k a using 33-, 
48-, 63-, 83-, 113- and 143-bp probe DNA (Aot = 2.5 nM) 
and 28-bp competitor DNA (C tot = 2000 nM). To analyze 
intersegment transfer between two nonspecific sites on dis- 
tinct DNA duplexes, we measured the rate constants k a us- 
ing 113-bp probe DNA (Aot = 2.5 nM) and 28-bp com- 
petitor DNA at various concentrations (C tot = 1000-16 000 
nM) (20). 

Analytical expression for the target search kinetics 

To analyze the experimental kinetic data, we derived a new 
general analytical expression for the target search kinetics. 
The derivation, which is given in the Supplementary Mate- 
rials, was based on the backward master equations of first- 
passage probabilities for discrete-state stochastic models 
(25,26). In our previous study (20), we used the pseudo-first- 
order rate constant /c ap p from fluorescence time-course data 
along with its analytical form that assumes the conditions 
Aot ^ Ptot ^ Qot and quasi-equilibrium for nonspecific 
protein-DNA interactions prior to the equilibrium of target 
association. In this study, we used the following analytical 
form of the second-order rate constant k a for protein-target 
association without assuming quasi-equilibrium: 

k = S J_ 

a K d , N + (0L - S) Aot + 0MC tot ' r N 

The parameters in this equation are as follows: 

s= Xi + y\y~ L - y L ) (2) 

(1 y){y l ~ m + y w )(j 1+L_m + y m ~ L ) 
y = 1 + (1/2)A- 2 - yA-2 + (l/4)A-4 (3) 

X = /A^. (4) 

The parameter X is the effective sliding length in units of 
bp (1,26); Dj is the one-dimensional diffusion coefficient for 
sliding in units of bp 2 s _1 ; K^n is the equilibrium constant 
in molar units for each nonspecific site; L is the total number 
of binding sites on the probe DNA; Mis the total number of 
binding sites on the competitor DNA; and m is the position 
of the target (located at the mth site from the edge). The 
parameter 0 is the number of possible orientations for each 



Nucleic Acids Research, 2014, Vol 42, No. 11 7041 



nonspecific site. Due to the structural pseudo-C2 symmetry 
of double-stranded DNA, 0 = 2 for proteins that bind as 
a monomer, and 0 = 1 for symmetric dimers. When 0 = 2 
is used, the microscopic parameters (e.g. K^n, k 0 ff,N, ku >N ) 
are defined for each orientation (20). The parameter x N in 
Equation (4) is the mean lifetime of a nonspecific complex 
and is given by 

?n = [k Q ff,N + kj,N {0^Aot + 0MC tot }] -1 , (5) 

where /c 0 ff,7v is the dissociation rate constant for a nonspe- 
cific complex, and kn >N is the second-order rate constant 
for intersegment transfer (also known as direct transfer) be- 
tween nonspecific sites on two distinct DNA duplexes (20). 
Although intersegment transfer can occur between two dis- 
tant sites on the same molecule if the DNA length is signifi- 
cantly longer than the persistence length (i.e. ~ 1 50 bp), such 
intra-molecular intersegment transfer is not considered here 
because only relatively short (<150 bp) DNA duplexes are 
used in our experiments. The annotation of symbols with 
the subscript 'N' indicates that those symbols represent pa- 
rameters for 'nonspecific' interactions. As explained in the 
Supplementary Materials, the current analytical expression 
(Equations (l)-(5)) is more general and simpler than the 
previous expression, while the two expressions provide nu- 
merically equivalent results (e.g. Supplementary Figure S- 
I). In the Discussion section, we also present a physically 
clearer description of the k a constant in terms of the an- 
tenna and trapping effects. 

Determination of the parameters for translocation of protein 
on DNA 

The one-dimensional diffusion coefficient Dj and the slid- 
ing length A were determined from the length-dependent k a 
data by fitting with Equations (l)-(4). The total numbers of 
binding sites, L for the probe DNA and M for the competi- 
tor DNA, were set to A — B + 1 , where A is the DNA length 
in bp, and B is the number of DNA base pairs covered by 
a protein molecule. Based on structural information about 
specific and nonspecific DNA complexes of Egr-1 (15,23), 
B = 9 was used, as previously described (20). Based on the 
sequences of the probe DNA duplexes, the target position m 
= 2 was used. Values of k a as a function of L were given as 
input data, and the parameters D] and X were optimized as 
independent fitting parameters via nonlinear least-squares 
fitting, in which the sum of the squared differences between 
observed and calculated k a values was minimized. In this 
fitting, r N was calculated via Equation (4) with X and Dj 
being optimized. 

The rate constant kn,N for intersegment transfer and the 
rate constant /c 0 ff,7v for dissociation were determined from 
the dependence of k a data on competitor DNA concentra- 
tions by fitting with Equations (l)-(5). In this calculation, 
values of k a as a function of C tot were given as input data, 
and the parameters ku^ and k 0 ft tN were optimized as inde- 
pendent fitting parameters via nonlinear least-squares fit- 
ting. The experimentally obtained Dj coefficient was used 
in this fitting calculation. Simultaneous fitting to the in- 
dependence and Qot -dependence data was also conducted 
using Equations (l)-(5) via optimization of Dj, knjy, and 
k 0 ff,N as the fitting parameters. 



These fitting calculations require values of the K^n con- 
stant for a nonspecific site (see Equation (1)). Values of K^n 
constants used for nonspecific interactions at 40, 60, 80, 110 
and 150 mM KC1 were 1.4, 3.6, 4.6, 9.6 and 16 |xM, respec- 
tively, which were obtained from our previous experimental 
data (20) in conjunction with the counterion condensation 
theory (27,28). All nonlinear least-squares fitting calcula- 
tions were performed with the MATLAB software (Math- 
Works, Inc.). Some examples of our MATLAB scripts for 
the fitting calculations are given in the Supplementary Ma- 
terials. 

RESULTS 

Target association kinetics at various ionic strengths 

Using the stopped-flow fluorescence assay, we measured the 
target search kinetics for the Egr-1 zinc-finger protein. This 
assay uses three macromolecular components: the probe 
DNA, protein and nonspecific competitor DNA (Figure 
1A). The probe DNA duplexes contain a target site and 
a FAM moiety that is tethered to a position near the tar- 
get. The fluorescence from the FAM probe changes upon 
binding of the protein to the target site (20). Immediately 
after a solution of the protein is rapidly mixed with a so- 
lution containing the probe DNA and competitor DNA, 
we began monitoring the time courses of the FAM fluo- 
rescence intensity (some example data are shown in Figure 
1 A). The time courses of fluorescence intensity were mono- 
exponential. The percentage change from initial to final in- 
tensities was virtually the same for the time-course data ob- 
tained under solution conditions where complete saturation 
of binding to the target site is expected at equilibrium (e.g. 
those shown in Figure 1A). A smaller percentage change 
was observed under conditions where the saturation level 
at equilibrium is lower (e.g. at 400 mM KC1). Using the 
time-course data, the pseudo-first-order rate constants in 
s _1 units were determined at various concentrations of pro- 
tein, from which the apparent second-order rate constant k a 
in M _1 s _1 units was calculated (see Supplementary Figure 
S-III). Using this assay, we measured the apparent second- 
order rate constants k a for target association of Egr-1 with 
the 1 13-bp probe DNA in the presence of 2 (xM nonspecific 
28-bp competitor DNA at 40, 60, 80, 110, 150, 190, 230, 
300 and 400 mM KC1. Figure IB shows a logarithmic plot 
of the k a data as a function of KC1 concentration. We found 
that the k a constant depended strongly on ionic strength in 
a non-monotonic manner and ranged from 0.055 x 10 8 to 
1.98 x 10 8 M _1 s _1 . Interestingly, the Egr-1 zinc-finger pro- 
tein exhibited the fastest target search kinetics at 150 mM 
KC1. Because Mg 2+ ions are known to play important roles 
in some protein-DNA/RNA interactions, we examined the 
influence of Mg 2+ ions on the target search kinetics under 
the same ionic strength. Keeping 0.5 x [K + ] + 2 x [Mg 2+ ] + 
0.5 x [Cl~] = 150 mM, we measured the target association 
kinetics as a function of Mg 2+ concentration in a physiolog- 
ical range (0.5-5 mM). The presence of Mg 2+ ions caused 
a slight (up to 23%) increase in the rate constant k a with 
the maximum k a found at [Mg 2+ ] = 1 mM (Supplementary 
Figure S-IV). These results indicate that physiological ionic 
strength renders an optimal condition for target search by 
the Egr-1 zinc-finger protein. 
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Time(s) KCI (mM) 

Figure 1. Target search kinetics of the Egr-1 zinc-finger protein as a function of ionic strength. (A) Stopped-flow fluorescence assay of the target search 
kinetics. The fluorescence time-course data shown were obtained using 113-bp probe DNA (Aot = 2.5 nM), nonspecific 28-bp DNA (C to t = 2000 nM) 
and protein (P tot = 50 nM) at 60, 1 50 and 300 mM KCI. Concentrations P t ot and C to t were varied in other measurements. (B) Apparent second-order rate 
constants k a for target association of the Egr-1 zinc-finger protein in the presence of 2000 nM nonspecific 28-bp DNA. The experiment was performed 
using 1 13-bp probe DNA at 40, 60, 80, 1 10, 150, 190, 230, 300 and 400 mM KCI. Values of k a constant were determined from the pseudo-first-order rate 
constants for target association at different concentrations of the protein. The solid line shown in the plot is a best-fit curve obtained using Equations 
(l)-(5) and the counterion condensation theory for the kinetic and thermodynamic parameters (see the main text). The data are shown on a logarithmic 
scale for each axis. For this panel and for Figures 2 and 3, the error bars represent the standard error of the mean (SEM) estimated from 6-10 replicates. 
For data points with no error bars, the SEMs were smaller than the size of the symbols. 



Ionic-strength dependence of sliding 

The DNA-length dependence of the target association ki- 
netics provides information on sliding of proteins on DNA 
(20,29-33). Using the 33-, 48-, 63-, 88-, 113- and 143-bp 
probe DNA (Aot = 2.5 nM) and 28-bp nonspecific com- 
petitor DNA (Qot = 2000 nM), we measured the apparent 
rate constants k a for the Egr- 1 zinc-finger protein at various 
KCI concentrations (Figure 2A). The DNA-length depen- 
dence of k a constants for these probe DNA duplexes was al- 
most linear at 40 mM KCI, but more curved and asymptotic 
at higher ionic strengths. Based on the length-dependence 
data, we determined the one-dimensional diffusion coeffi- 
cient D\ (in bp 2 s _1 ; Figure 2B) and the effective sliding 
length k (in bp; Figure 2C) as a function of KCI concen- 
tration. Although some literature assumed that the Dj co- 
efficient for sliding should be independent of ionic strength 
(11,34-36), we observed significant ionic-strength depen- 
dence of the experimental Dj coefficient for the Egr-1 zinc- 
finger protein (Figure 2B). Our interpretation of this result 
is given in the Discussion section. We also found that the 
sliding length k depended strongly on ionic strength: for ex- 
ample, k = 122 ± 2 bp at 40 mM KCI, and k = 38 ± 3 
bp at 150 mM KCI. As noted previously (6,20), the DNA- 
length dependence of the target association kinetics should 
be an increasing function with an asymptote for lengths > 
2k. In fact, such asymptotic length dependence was clearly 
observed (Figure 2A). At ionic strengths as high as 300 mM 
KCI, no dependence on length was found for the 33-143 
bp DNA duplexes (e.g. see k a values in Supplementary Fig- 
ures S-IIIb and c). Although we cannot determine the slid- 
ing length k in such a case, the absence of length depen- 
dence qualitatively suggests a very short sliding length. At 
40-150 mM KCI, the length dependence of k a constants for 
the same set of DNA duplexes was strong enough due to 2k 
being comparable to the DNA lengths used in the experi- 
ments (Figure 2 A and B). 



Ionic-strength dependence of intersegment transfer and dis- 
sociation 

Intersegment transfer is the direct translocation of protein 
from one DNA duplex to another (without going through 
the intermediary of free protein) via an intermediate in 
which a protein molecule transiently bridges two DNA du- 
plexes. The DNA concentration dependence of the translo- 
cation kinetics allows us to distinguish intersegment trans- 
fer from translocation via dissociation and re-association 
(20,23,37^3). We varied the concentration of the 28-bp 
nonspecific competitor DNA between 0.5 and 16 jxM and 
measured the rate constant k a for the association of Egr-1 
to the target on the 1 13-bp probe DNA (Figure 3A). Using 
the DNA concentration-dependence data, we determined 
the rate constant ku,N for intersegment transfer between 
nonspecific sites of two DNA duplexes and the rate con- 
stant k 0 ff t N for the dissociation of nonspecific complexes as 
a function of KCI concentration (Figure 3B and C). 

A surprising finding from this analysis was that the ionic- 
strength dependence of k 0 ff )N is extremely strong; for ex- 
ample, k off , N = 0.20 ± 0.26 s -1 at 40 mM KCI and k oStN 
= 920 ± 240 s- 1 at 150 mM KCI. As shown in Figure 
3B, the ionic-strength dependence of k 0 ff )N clearly obeys 
the log-log linear relationship in accordance with the ki- 
netic theory by Lohman et al (44), which is based on the 
counterion condensation arguments (27,28). We found that 
the rate constant ku >N for intersegment transfer also de- 
pended strongly on the ionic strength, but to a lesser de- 
gree. Due to this difference, translocation via intersegment 
transfer is more efficient than translocation via dissocia- 
tion and re-association at relatively low ionic strengths (i.e. 
kn )N (j)MC tot > k 0 ff t tf), whereas the latter can be more effi- 
cient (i.e. kiT !N 4>MC tot < k 0 ff >N ) at higher ionic strengths. In 
fact, for the C tot -dependence data at low ionic strengths, the 
kinetic model with intersegment transfer gave a far better 
fit than the model without intersegment transfer, whereas 
the two models gave more similar results for the data at 
high ionic strengths (Figure 2A). At KCI concentrations sig- 
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Figure 2. Ionic-strength dependence of sliding. (A) Length dependence of k a constants at 40, 60, 110 and 150 mM. The horizontal axis represents DNA 
length in bp. Solid lines are the best-fit curves obtained using Equations (l)-(4). The parameter L, which represents the number of binding sites on the 
probe DNA, was set to (DNA length in bp) —8 (see the main text). The one-dimensional diffusion coefficient Dj and the effective sliding length X were 
optimized in the fitting calculations. Z>t 0 t = 2.5 nM and C to t = 2 |jlM were used in the measurements. (B) The one-dimensional diffusion coefficient D\ for 
sliding of the Egr- 1 zinc-finger protein as a function of KCI concentration. (C) The effective sliding length A, as a function of ionic strength. 
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Figure 3. Ionic-strength dependence of the dissociation and intersegment transfer of the Egr-1 zinc-finger protein. (A) Dependence of the k. dpv constant on 
the concentration Q 0 t of the 28-bp nonspecific DNA. Solid lines are the best-fit curves obtained using a model with intersegment transfer (i.e. Equations 
(l)-(5)), and dotted lines are the best-fit curves using a model without intersegment transfer (i.e. Equations (l)-(5) with 1cyt,n = 0). (B) The rate constant 
k 0 ff,N for dissociation from a nonspecific site as a function of KCI concentration. (C) The rate constant kn^ for intersegment transfer between two 
nonspecific DNA molecules as a function of KCI concentration. 



nificantly higher than 150 mM, translocation via dissoci- 
ation and re-association governs the C to t -dependence and 
conceals the influence of intersegment transfer, which pre- 
cluded us from further analyses. 

DISCUSSION 

Consideration on the ionic-strength dependent Dj coefficient 

Although some previous studies assumed that sliding of 
proteins on DNA should be virtually independent of ionic 
strength (1 1,34-36), our D] data for Egr-1 (Figure 2B) show 
significant dependence on ionic strength. Bonnet et al also 
showed similar results for the restriction enzyme EcoRV 



(45). One may interpret that ionic-strength dependence 
of the experimental D] data is influenced by 'hopping' 
(i.e. short-range correlated translocation via dissociation 
and re-association (1)). Because our kinetic model does 
not discriminate between short-range (i.e. hopping) and 
long-range translocations via dissociation followed by re- 
association, kinetics of dissociation in these processes are 
equally represented by the rate constant A: 0 fr,jv- The Dj mea- 
surement can be significantly affected by hopping if the in- 
trinsic dissociation rate constant for hopping is compara- 
ble to the first-order rate constant k s \ N for sliding from one 
site to an adjacent site. To achieve this condition, however, 
the intrinsic dissociation rate constant for hopping must be 
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> 1000-fold larger than the observed dissociation rate con- 
stants k 0 ff rN (see Figures 2B and 3B; note that the Dj co- 
efficient in bp 2 s _1 units is equivalent to k s \ f N in s _1 units 
(20)). Such extremely rapid dissociation only for hopping 
seems unrealistic, although we cannot completely exclude 
this possibility. Thus, it is unlikely that the ionic-strength 
dependence of the Dj data for Egr-1 arises from the influ- 
ence of hopping. 

A more reasonable interpretation of the ionic-strength 
dependent Dj data is that energy barriers of electrostatic na- 
ture are involved in the sliding process. This interpretation 
is consistent with results of some previous studies. For ex- 
ample, coarse-grained molecular dynamics simulations of 
the DNA search by proteins also indicated significant de- 
pendence of the Dj coefficient on ionic strengths (23,46,47). 
Besides, if sliding is barrierless diffusion (e.g. 'movement on 
an isopotential surface' as speculated by Winter et al. three 
decades ago (11)), then the Dj coefficient should be a rela- 
tively simple function of the hydrodynamic radius, as is the 
case for three-dimensional diffusion in solutions. Despite 
this expectation, data available in the literature for proteins 
of different sizes show that Dj is virtually independent of 
molecular size (e.g. 20,35,36,45,48,49). Furthermore, many 
proteins exhibit sliding on DNA far slower than expected 
from the Stokes-Einstein relation (as reviewed in (5)). These 
observations and our ionic-strength dependent Dj data can 
be explained if sliding involves electrostatic energy barriers, 
which could be relevant to breakage of ion pairs between 
protein and DNA (50). 



Egr-1 's kinetic properties are well suited for efficient scanning 
of DNA in vivo 

Our kinetic data show that the Egr-1 zinc-finger protein ex- 
hibits its optimal search efficiency at the physiological ionic 
strength (i.e. 150 mM KC1). In addition, two other find- 
ings in our current study suggest that Egr- 1 has character- 
istics that enable efficient scanning of DNA in the nuclei. 
First, the effective sliding length for the Egr-1 zinc-finger 
protein at 150 mM KC1 seems to be suitable for scanning 
accessible regions of chromosomal DNA. According to the- 
oretical work by Mirny et al. (6), proteins can scan DNA 
most efficiently when 2A is comparable to the lengths of the 
linker DNA between nucleosomes. In human cells, linker 
lengths are widely distributed with an average of ~56 bp 
(13). Because our data indicate that 2A is comparable to 
these lengths, the sliding and dissociation properties seem 
suitable for Egr-1 to scan linker DNA in vivo. Second, our 
data also indicate that the intersegment transfer of Egr-1 is 
very efficient at physiological ionic strength. The previous 
study using the coarse-grained molecular dynamics simula- 
tions suggested that Egr- 1 can conduct intersegment trans- 
fer between two DNA duplexes separated by 60 A (23). Be- 
cause this separation corresponds to the distance between 
two DNA ends of a nucleosome particle (51), intersegment 
transfer could allow Egr-1 to efficiently bypass nucleosome 
particles and continuously scan DNA (20,23). Thus, our 
data collectively suggest that the kinetic properties of Egr-1 
are well suited to efficiently scan DNA in vivo. 



Influences of the antenna and trapping effects and interseg- 
ment transfer 

Our data show that the target search by the Egr-1 zinc- 
finger protein is fastest at physiological ionic strength. What 
is the origin for the optimal kinetic efficiency at physiolog- 
ical ionic strength? To offer a theoretical explanation, we 
provide a general expression for the kinetics of target loca- 
tion by DNA-binding proteins. This is done by transform- 
ing Equation (1) into the following form, which shows how 
the apparent rate constant k a for the target association is re- 
lated to the intrinsic association rate constant k on>N (= k 0 ff >N 
/K^n) of each site: 

k a = prjSk 0 ^ N . (6) 
The parameters p and r) in this equation are as follows: 

P ~ K d , N + (0L - S) Act + 0MC tot 

V^(0^Aot + 0MC to t) rQ . 
>7 = 1 + j • (8) 

K 0 ff,N 

Equation (6) quantitatively explains how the target associa- 
tion kinetics is influenced by the antenna effect, the trapping 
effect, and intersegment transfer, which are represented by 
the parameters S, p, and rj, respectively. The parameter S 
given by Equation (2) corresponds to the effective size of 
the antenna. Nonspecific sites near the target on the same 
DNA serve as an antenna and make the target association 
S-fold faster. Because this is equivalent to S sites serving as 
the target, the term 0L - S in Equation (7) corresponds to 
the number of nonspecific sites outside the antenna on the 
target-containing DNA duplex, and (0L - S)D tot + 0MC to t 
corresponds to the net overall concentration of nonspecific 
sites. Therefore, the parameter p represents the fraction of 
protein molecules that are not trapped by any nonspecific 
sites during the target search process. If (0L - S)D tot < 
0MC tot (which is usually satisfied), then the parameter p be- 
comes independent of S, and the expression becomes equiv- 
alent to the expression given by Esadze and Iwahara under 
the assumption of quasi-equilibrium for nonspecific inter- 
actions (20) (see the Supplementary Materials). If the tar- 
get position is near the middle of the DNA duplex (i.e. m & 
L/2), the parameter S (Equation (2)) becomes independent 
of m and can be reduced to 

S«2Xtanh^V (9) 

which corresponds to the expression for the antenna effect 
given in previous literature (1,6) (see Supplementary Figure 
S-II). Equation (8) shows that intersegment transfer consid- 
erably enhances the target search kinetics (i.e. rj > 1) when 
and only when the pseudo-first-order rate constant for in- 
tersegment transfer is substantially larger than the dissoci- 
ation rate constant of nonspecific complexes. As indicated 
by Equation (6), the ratio of k a to k on ^ is given by pr)S, in 
which p represents an attenuation factor (0 < p < 1) due to 
the trapping effect, rj is an enhancement factor (rj > 1) due 
to intersegment transfer, and S is an enhancement factor (1 
< S < 2X) due to the antenna effect. 
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Figure 4. The attenuation factor p due to the trapping effect (red), the en- 
hancement factor r\ due to intersegment transfer (magenta) and the en- 
hancement factor S due to the antenna effect (blue) for the target associa- 
tion kinetics of the Egr-1 zinc-finger protein. These parameters were calcu- 
lated from the current experimental data on the rate constants /c 0 ff,7V and 
ku^, the one-dimensional diffusion coefficient Dj, and the dissociation 
constant K^n f° r the Egr-1 zinc-finger protein along with Equations (2), 
(7) and (8). The following conditions were used: Aot = 2.5 nM; L = 105 
sites; C to t = 2000 nM; M= 20 sites. Ionic-strength dependence represented 
by logk = alog[KCl] + b was assumed for /c 0 ff,iv, ^it,tv, Dj and K^n, and 
the parameters a and b were calculated from the current experimental data. 

Optimal target search by Egr-1 at physiological ionic 
strengths 

Based on the above analytical expression of k a , we can now 
explain the ionic-strength dependence of the target search 
kinetics in terms of the antenna effect, the trapping effect 
and intersegment transfer. Using the current experimental 
data, we calculated the parameters p, rj and S for the Egr- 
1 zinc-finger protein as shown in Figure 4. The parameters 
S and t] are decreasing functions of ionic strength, and the 
antenna effect and intersegment transfer substantially ac- 
celerate the target search kinetics at low ionic strengths. In 
contrast, the parameter p is an increasing function of ionic 
strength, and the trapping effect substantially decelerates 
the target search kinetics at low ionic strengths. As previ- 
ously described, the rate constants for electrostatically as- 
sisted macromolecular association (such as k on>N ) are typ- 
ically a decreasing function of ionic strength (52,53). The 
rate constant k a as the product prjSk on N (Equation (6)) is 
therefore maximized at a particular ionic strength as shown 
in Figure IB. Our data indicate that the Egr-1 zinc-finger 
protein achieves its optimal kinetic efficiency at physiologi- 
cal ionic strength through the compromise between the pos- 
itive and negative impacts of nonspecific interactions with 
DNA. 

CONCLUSIONS 

In this study, we have demonstrated the optimal search effi- 
ciency of the Egr-1 zinc-finger protein at physiological ionic 
strength and explained the ionic-strength dependence of the 
target search kinetics in terms of the antenna and trapping 
effects and intersegment transfer. Our results suggest that 
Egr-l's kinetic properties in the sliding, dissociation and 
intersegment transfer processes are well suited for efficient 
scanning of DNA in vivo. This study provides insight into 
how the inducible transcription factor Egr-1 rapidly acti- 
vates the genes in response to cellular stimuli such as synap- 
tic signals and vascular injury. This work also provides a 
theory that quantitatively explains the interplay between the 



antenna and trapping effects and intersegment transfer in 
the target DNA search process. A deeper understanding of 
this interplay could be helpful in the engineering of tran- 
scription factors and DNA repair/modifying enzymes. For 
example, our theory predicts that protein engineering that 
simply increases the affinity for DNA will enhance the trap- 
ping effect and possibly result in slow target location. Be- 
cause Egr-1 (Zif268) has been used as a scaffold for zinc- 
finger technology for artificial gene control, we expect that 
current and future studies of DNA scanning by Egr-1 at the 
molecular and atomic levels may facilitate the kinetic im- 
provement of artificial zinc-finger proteins. 

SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online. 
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